Detection of Difference between News Articles on the Same Topic Based on Sequential Comparison
نویسندگان
چکیده
Currently, a lot of news articles are published on theWeb, and it is getting easier for us to read them. However, the number of articles are too large for us to read all of them. Although some Web sites cluster/classify news articles into some topics (categories), it is not enough since a large number of articles are still in each topic. Detecting difference between articles on one topic will be one of the solution to comprehend the whole topic. In this paper, we propose a method for detection of difference between news articles on the same topic. Articles are sequentially compared by three different comparison units: paragraphs, sentences, and simple sentences. Our method is evaluated by applying it to Japanese news articles.
منابع مشابه
A New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملFair News Reader: Recommending News Articles with Different Sentiments Based on User Preference
We have developed a news portal site called Fair News Reader (FNR) that recommends news articles with different sentiments for a user in each of the topics in which the user is interested. FNR can detect various sentiments of news articles, and determine the sentimetal preferences of a user based on the sentiments of previously read articles by the user. While there are many news portal sites o...
متن کاملArabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents
Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...
متن کاملContrastive Analysis of Political News Headlines Translation According to Berman’s Deformative Forces
The present research aimed at investigating the deformation of political news headlines translation between English and Persian News Agencies based on Berman`s deformative system. For this purpose, 100 news headlines in English were selected from BBC, Reuters, Associated Press, France, France 24, Financial Times, Business Times, New York Times, Politico, Guardian, CNN, Bloomberg, Middle East Ey...
متن کاملConsistency of textual expression in newspaper articles: an argument for semantically based query expansion
This article investigates how consistent different newspapers are in their choice of words when writing about the same news events. News articles on the same news events were taken from three Finnish newspapers and compared in regard to their central concepts and words representing the concepts in the news texts. Consistency figures were calculated for each set of three articles (the total numb...
متن کامل